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provides a way to systematically construct a local mean force in order to optimally 
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the approximation of the potential of mean force under both linear and non-linear 
coarse graining mappings (e.g., reaction coordinates, end-to-end length of chains). 
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examples in molecular systems. 
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I. INTRODUCTION 


Complex molecular systems are materials of amazing diversity ranging from polymers 
to colloids, hybrid nanocomposites, biomolecular systems, etc, which are directly related 
with an enormous range of possible applications in nano-, bio-technology, food science, 
drug industry, cosmetics etc. Due to the above reasons molecular simulations of complex 
systems is a very intense research area.^ A main challenge in this held is to predict structure- 
properties relations of such materials and to provide a direct quantitative link between 
chemical structure at the molecular level and measurable structural and dynamical quantities 
over a broad range of length and time scales. 

On the microscopic (atomistic) level, detailed all-atom molecular dynamics (MD), or 
Monte Carlo (MC) simulations allow direct quantitative predictions of the properties of 
molecular systems over a range of length and time scales.^i® However, due to the broad 
spectrum of characteristic lengths and times involved in complex molecular systems it is 
not feasible to apply them to large realistic systems or molecules of complex structure, such 
as multi-component biomaterials, polymers of high molecular weight, colloids etc. On the 
mesoscopic level, coarse-grained (CG) models have proven to be very efficient means in order 
to increase the length and time scales accessible by simulations.^^^^^ 

CG (particle) models can be roughly categorized, based on the way they are developed, 
into two groups: (a) Ad hoc or phenomenological CG models, such as simple bead spring or 
lattice ones, which are primarily used to study generic behavior (e.g. scaling properties) of 
complex systems but lack a link to specihc systems.^ The interactions between the CG groups 
in these models are described through semi-empirical functional forms obtained through 
previous knowledge and with a lot of physical intuition, (b) Systematic CG models, which 
are usually developed by lumping groups of atoms into groups, i.e. ’’superatoms”, and 
deriving the effective GG interaction potentials directly from more detailed (microscopic) 
simulations. Such models are capable of predicting quantitatively the properties of specific 
systems and have been applied with great success to a very broad range of molecular systems 
(see for example refs.®®®^and references within). 

A main challenge in the later family of GG models is to develop rigorous atomistic to 
GG methodologies that allow, as accurate as possible, the estimation of the GG effective 
interaction. With such approaches the hierarchical combination of atomistic and GG models 
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could be in order to study a very broad range of length and time scales of specific molecular 
complex systems without adjustable parameters, and by that become truly predictivej ^° * ^^ * ^^ ^ 
Let us assume a specihc molecular system. The overall procedure of systematic coarse¬ 
grained modeling for this system, based on detailed microscopic data, is shortly described 
through the following stages: (a) Execution of microscopic (e.g. ab-initio or atomistic) 
simulations on small model systems, i.e. usually a relatively small number of molecules with 
a rather low molecular weight is considered, (b) Choose of the CG map (transformation 
from the atomistic to the CG description), (c) Development of the CG effective interaction 
potential (force held), (d) Execution of the GG (e.g. MD, Langevin dynamics, LD, or 
MG) simulations and (e) Re-introduction of the missing atomistic degrees of freedom in the 
GG structures, in case the properties under study require atomistic detail. From all above 
stages the development of the GG force held is the most challenging one. Indeed, an accurate 
estimation of the way GG ’’superatoms” interact to each other is a conditio sine qua non 
in order to understand the behavior and to (quantitatively) predict the properties of the 
specihc complex molecular system under study. 

Note that from a mathematical point of view coarse-graining is a sub-held of the dimen¬ 
sionality reduction.^ Indeed, there are several statistical methods for the reduction of the 
degrees of freedom under consideration, in a deterministic or stochastic model, such as prin¬ 
cipal component analysis and dihusion maps.^Here we focus our discussion on GG methods 
based on statistical mechanics, which are used extensively the last two-three decades in the 
theoretical modeling of molecular systems across a very broad range of disciplines, from 
physics to chemistry and biology as well as in engineering sciences. 

There exists a variety of methods that construct a reduced model that approximates 
ehective properties of complex systems based on statistical mechanics. These methods usu¬ 
ally consider the optimization of proposed parametric models using diherent minimization 
principles, that is considering a pre-selected set of observables z = 1,..., fc} and then 
minimizing over a parameter set 0, 

k 

i=l 

where are the atomistic and proposed Gibbs measures respectively. Different 

methods consider different sets of observables. For example: 

(a) In structural based methods the observable is the pair radial distribution function g{r), 
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related to the two-body potential of mean force (see section VIC), for the intermolecular 
interaction potential, and distribution functions of bonded degrees of freedom (e.g. bonds, 
angles, dihedrals) for CG systems with intramolecular interaction potential P * ^ * ^ * ^°*^n 

(b) Force matching (FM) method^^^^^^ consider as observable function the force fj{x) = 
—VxjU{x), j = 1,..., N, for an iV-particle system with interaction potential U{x), x G . 

(c) The relative entropy (RE)Pi2^ method employs the minimization of the relative entropy 
pseudo-distance 

dfi{x) 


Kpll/) = 


log 


djjP^x 


-djjL{x ), 


These methods, in principle, are employed to approximate a many body potential describing 
the equilibrium distribution of CG particles observed in simulations of atomically detailed 
models. The many body potential is dehned through the renormalization group mapP^ that 
is equivalent to the potential of mean force (PMF)PSI in case the former is differentiable. 
The force-matching (or multi scale coarse graining (MSCG) ) and the relative entropy are 
minimization methods that construct a best £t of a proposed coarse graining potential 
for systems in equilibrium. The force-matching method determines a CG potential from 
atomistic force information through a least-square minimization principle, to variationally 
project the force corresponding to the potential of mean force onto a force that is dehned 
by the form of the approximate potential. The relative entropy approach obtains optimal 
CG potential parameters by minimizing the relative entropy between the atomistic and 
the CG Gibbs measures sampled by the atomistic. A brief review and categorization of 
parametrization methods in equilibrium is given in ref.^ 

Besides all the above, a classical method for calculating free energy differences using ar¬ 
bitrary reaction coordinates is thermodynamic integration (TI) theory.®® Thermodynamic 
integration is based on writing free energy differences as the integral of free energy derivative 
and thus computing the derivatives (mean force) instead of directly the free energy. 

The purpose of this work is: (a) To reformulate in the probabilistic language of conditional 
expectations the force matching method. In turn, the conditional expectation formulation 
allows us: (b) To reveal the connection of force matching with thermodynamic integration 
that provides a way to construct a local mean force in order to best approximate the potential 
of mean force when applying the force matching method, (c) To present in a probabilistic 
formalism the equivalence of relative entropy and force matching methods which we derive 
for general nonlinear coarse graining maps. We furthermore discuss structure-based (SB) GG 
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methods thus presenting a complete picture of the known many body potential estimation 
methods for systems at equilibrium and their relation. 

Furthermore, the probabilistic formalism gives a geometric representation of the force 
matching method, i.e. recast the force matching as a projection procedure onto the space of 
coarse obsvervables (we refer specihcally to Figure]^. The novelty and advantages of our 
approach is that it allows us to dehne a generalized force matching minimization problem 


min [II IP] 

Cz 

applicable for linear and nonlinear CG maps ^ —)■ The force matching condition 

introduced 

h{x) = 3r\x)Bax)f{x) + ■ Jr'(a:)De(a;), (1) 

where J^(x) = D^(x)D^*(a:),D^ e i = 1,... ,m, j = 1,..., N 

ensures the best approximation of the PMF. As in thermodynamic integration h{x) is called 
the local mean force. A more general result is available in Theorem This elucidates 
the direct connection of the above discussed particle CG methodologies with the standard 
thermodynamic integration approaches. 

The current work is directly related to previous works that concern linear CG mapping 
schemes.^^^^ Here we recast and extend these works in a probabilistic formalism in order 
to present and compare the relative entropy and force matching methods that allows us to 
generalize the methodology to nonlinear coarse-graining maps. In the case of a linear CG 
map the local mean force is 

h{x) = 3C^{x)'D^{x)f{x) 


which reduces to the result in ref.^, see the examples in Section V A Notice that for linear 
CG maps the last term in relation ([^ vanishes. The proposed formula for the local mean 
force extends the work^^^El aspects: (a) to any non-linear CG map and (b) the 

existence of a family of appropriate local mean forces h{x). 

Finally, we should note that in the above discussion we focus on molecular systems at 
equilibrium. The study of non-equilibrium systems is an even more challenging area related 
to various phenomena such as the response of the molecular systems on external stimuli (e.g. 
rheological properties or mechanical behavior of composites systems). The development of 
GG force helds for systems under non-equilibrium conditions, based on the information 
theory and path-space tools of relative entropy is the subject of ref.®^. 
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The structure of this work is as follows. In Section[TTlwe introduce the atomistic molecular 
system and its coarse graining through the dehnition of the CG map and the n-body potential 
of mean force. The probabilistic formulation of the force matching method and the best 


approximation of the PMF are presented in Section III, while the force matching condition 


for approximating the PMF for any, linear or non-linear CG map, are given in Section IV 


In Sections V A and VB we calculate the analytic form of the local mean force for examples 
of linear and non-linear CG maps in molecular systems. A second result of the current 


work is presented in Section VI where we prove that the relative entropy minimization and 
the force matching methods are equivalent, producing the same approximation to PMF up 
to a constant. Furthermore, for completeness we present the structure based methods in 
Section VIC We close with Section |VTT summarizing and discussing the results of this work. 


II. ATOMISTIC AND COARSE-GRAINED SYSTEMS 

Assume the prototypical problem of N (classical) molecules in a box of volume V at 
temperature T. Let x = {xi,... ,xn) ^ describe the position vectors of the N particles 
in the atomistic (microscopic) description, with potential energy U{x). The probability of 
a state x at temperature T is given by the Gibbs canonical measure 

fi{dx) = Z~^ exp{—f3U{x)}dx , 

where Z = e~^^^^'>dx is the partition function, [3 = and ks the Boltzmann constant. 

We denote f{x) the force corresponding to the potential U{x) that is / : —?• 

/.(x) =-V.,.t/(a;), j = l,...,A. (2) 

For such a system the u-body, n < N, potential of mean force (PMF) ..., Xn^^^ 

is dehned through 

t/™^(xi,...,x„) = -^log^(”^(a;i,...,x„), 

where ..., Xn) is the u-body distribution function 

M N\ f 

g^'^>{xi,...,Xn) = ^ / fi{x)dxn+i.. .dXN , 

and p = ^ is the number density. 
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Coarse-graining is considered as the application of a mapping (CG mapping) ^ —)■ 

^3M 


X H-)■ G 


pSM 


on the microscopic state space, determining the M(< N) CG particles as a function of the 
atomic conhguration x. We denote by 2 ; = (^i,..., zm) any point in the CG conhguration 
space and use the ^ notation for quantities on the CG space. We call ’particles’ the 

elements of the microscopic space with positions Xj G j = 1,... ,iV and ’CG particles’ 
the elements of the coarse space with positions Zt G i = 1,..., M. We should note that 
a CG mapping ^(x) does not necessarily maps to three-dimensional CG particles, it can be 
considered in the more general form ^ —)■ M™, for any m G N with m < 3N. This is 

the case, for example, when considering some reaction coordinates, like the bending angle 


and the end-to-end distance (see examples in Section VB). 


The conditional Helmholtz free energy A{z) related to the CG mapping ^(x), dehned 
by the renormalization group mapP^, is based on the property that for any observable 0 : 
—)■ M of the form (j){x) = (j){^{x)) it holds 

~ / 0(^(x))/i(x)(ix = / / (j){z)fi{x)dxdz 

jRSiV J^3M J{j,. g(a;)=2} 


/r3M 


(j){z)fi{z)dz = E^[E^[(/)|^]], 


where E^[(/)] denotes the expectation of 0(x) with respect to the probability measure fi{dx), 


h(^) = / /i(x)(ix, G(2;) = {x G : .^(x) = ^} , 

Jn{z) 


( 3 ) 


and E„ [01^] is dehned by 


E^ [4>\z] := E^ [ 01 ^ = z] = / (j){x)fi{x)dx, for all 2: G 

Jn{z) 




( 4 ) 


being the conditional expectation of the observable quantity 0(x) that represents the ex¬ 
pectation of 0(x) with respect to the Gibbs measure fi{dx) for given 2; = .^(x) hxed. For a 
complete mathematical formulation of conditional expectation see Appendix The condi¬ 
tional Helmholtz free energy A(z) is thus dehned such that the CG probability density fi{z) 
is of Gibbs type, i.e. 

A{z) =-^ log fi{z) - ^ log Z . (5) 
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The conditional potential of mean force (PMF) is directly related to the free 

energy through the reversible work theoreml ^^ * ^^ ! In many works the free energy and potential 
of mean force are used interchangeably. Here we use the potential of mean force notation 
and write = A(z). We define the mean force —)■ corresponding to 

the PMF dehned by (|^, assuming it exists, 

z = l,...,M. ( 6 ) 

The calculation of the potential of the mean force is a task as difficult/costly as is calculating 
expectations on the microscopic space. Thus one seeks an effective potential function U{z), 
that approximates as well as possible the PMF, and is easy to formulate and calculate. 
This is the ultimate goal of all the methods (i.e., structural-based methods, force matching, 
relative entropy minimization) for systems in equilibrium, that we present in the following 
sections in detail. In all the above mentioned methods, one usually proposes a family of 
interaction potential functions U{z) in a parametrized, U{z]6), 6^ G 0, or a functional form 
U{z) and seeks for the optimal U*{z) that ’best approximates’ the PMF. We denote by 

ft(j{dz) = Z~^ exp{—f3U{z)}dz , 

the equilibrium probability measure at the coarse grained configurational space for the given 
CG potential function U{z), where Z = J e~^^^^'^dz is the CG partition function. 

III. CONDITIONAL EXPECTATION AND FORCE MATCHING 

Force matching is based on the observation of a vector field h : —)■ 

X ^ h{x) e , (7) 

from microscopic simulations or experimental observations, and the definition of an opti¬ 
mization problem in order to find an optimal estimator G*{z) of h{x) as a function of con¬ 
figurations in the coarse space The optimization problem is to find a G* : —)• 

such that the mean square error 

£(G;A)=E, [||h-GK)f] = / \\h(x) - a{((x))rMx)dx, (8) 

is minimized, where || ■ || denotes the Euclidean norm in In general h{x) can be 

any observable quantity which, eventually, for the force matching methocP that we study 




represents the atomistic force on the coarse particles in conhguration x E Hence, the 

force matching method at equilibrium is seeking for the optimal force G* (z) as the minimizer 
of the mean square error £(G; h) over a set of proposed CG forces G{z) at the coarse space. 

We recast the force matching optimization problem, as proposed irP^, in probabilistic 
terms using the concept of conditional expectation and its interpretation as a projection on 
a subspace of observables. First, we present a well-known result in probability theory, see 
ref.l^. We include the proof in Appendix for completeness. We denote by h^(/i) the space 
of mean square integrable vector fields with respect to i.e. h^(/i) = {h : —)■ 

/jg 3 jv \\h{x)\\‘^ix{dx) < cxd}. For a given CG map ^ we denote 

= {g ^ L^(/r)|there exists G : R^^ -E R^^ such that g = G o 

the space of observables having the properties: (i) Are square integrable observables with 
respect to the Gibbs measure fi{dx) and (ii) are functions of the coarse variable ^{x). Prop¬ 
erty (i) ensures the space has a geometry that allows an easy formulation of the concept of 
projections for functions, e.g. it is a Hilbert space. The later property (ii) is called a ” (sub-) 
a algebra” in mathematics in the context of conditional expectations, see Appendix for 
further information. 

Lemma 1 For a given h G the minimization problem 

mf£(G;/!)=mfEq||/!-G«)|l"] , 

(jr Cz 

where inf is taken over all G G L^(/i;^) has the unique solution 

F{z) =E^[h|^ = ; 2 ], G 

Furthermore, 

C{G- h) = C{F- h) + [||F(0 - G(OI|'] for any G G 0 • (9) 

A geometric description of this result is shown in Figure In practice the projection of h 
is performed on a subset of L^(/i;^), the space of feasible observables 

that is the collection of all proposed CG force helds G{z), see Figure The set S may 
consist of non-parametrized or parametrized elements, i.e., a set of splines, the span of a 
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h=h(x) 



FIG. 1. Projection (1) of any microscopic observable h{x) on the space of all CG observables 
G(^(x)), is given by the conditional expectation E^[/i|^]. 


truncated basis of etc. When the minimization problem is over £ the solution is 

not necessarily the conditional expectation dehned by relation Q, as the Lemma 

states since it is possible that E^[h|^] ^ £, rather it is a G* G for which relation ([^ holds, 
see the schematic in Figure]^ In this case we say that G*{z), the projection of h{x) on 
£, is a best approximation of the F{z) = E[h|^]. With the following theorem we state a 
necessary and sufficient condition that the observed quantity h{x) should satisfy so that the 
mean force (|^ is best approximated with a force matching method. 

Theorem 2 (Force matching) Let h G L^(/i) such that 

E^[h\z] = z G , (10) 


then for £ C 

a) g £, 

mf £(G; h) = C{F^^^-h), 

b) if ^ £ the minimizer G* e £ o/£(G; h) satisfies 


C{G*-h) 


\ni^C{G-h)=C{F 


PMF. 


= C{F 


PMF. 


h) + inf E 
Ge£ ‘ 



h) + E^ 


^PMF 


'’“'■({) - G'(e) 
K)-G-K)f’ 


2 
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FIG. 2. Geometric representation of the force matching procedure. Projection (2) of the observable 
h{x) over the set of feasible coarse observables £. 


Proof: Is a consequence of Lemma 

The result of Theoremj^in combination with Lemma[^states that (10) is a necessary and 
sufficient condition for h such that the corresponding force matching optimization problem 
has optimal, or near optimal, solution the mean force It is thus evident that the 

force matching method calculates exactly the mean force if the G £1. In the case that 

pPMF ^ g the force matching method is best approximating the mean force in the sense that 
it calculates a G*, the best approximation of jg element of S closest to 


F 


PMF 


m 


the distance, see the schematic at Figure 2. The error done is exactly 


E 






2 


The h{x) that satisfies ( [Io| is called local mean force, as in the thermodynamic integration 
theory. Note that Theoremsuggests that h{x) should have a specific form which is not at 
all obvious. The purpose of the following section is to provide closed form representations 
for the local mean force h{x). 

Summarizing, the starting point, and overall goal, of a force matching method is to find 
^PMF(^) ^ fixed coarse graining map ^{x). Ideally could be calculated by solving 

a least squares problem of the form infc E[||FP'^p — G|p] over a set S of CG models, for 
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which the optimal solution is obviously a best approximation of But although 

^PMF(^) jg gxed it is not known, thus there is a need for using a computable quantity h{x) 
instead of in the minimization problem that will still has solution or a 


best approximation G*{z). Therefore we construct an h{x) that satisfies (10), for which 
there exist many possible choices as is proved in the following section. The error in a force 
matching method has two sources, first when the projection on of the observed 

quantity h{x) is not the 

and second when the set of proposed CG forces £ does not include 




;iPMF 


(0-G*(e)| 


IV. CONSTRUCTION OF THE LOCAL MEAN FORCE AND 
SYSTEMATIC FORCE MATCHING 

In this section we give a closed form of the local mean force h{x), appearing in the force 
matching problem (|^ for which the mean force is best approximated, based on the statement 
of Theorem and results from thermodynamic integration theory (TIjPSlESl. We introduce 
the derived form of h{x) as the appropriate observable to be used in a force matching 
method implementation in order to best approximate the mean force. In thermodynamic 
integration the goal is to calculate free energy differences for a given reaction coordinate 
using the derivative of the free energy, see Chapter 3 irP^. We think of the coarse grained 
variable .^(x) as a reaction coordinate, even though in the later case one does not necessarily 
consider coarse graining of the system. Then we use the result that the derivative of free 
energy (the mean force) is given as the conditional expectation on of a local mean force 
that has a specific form, a result that we state and prove here for completeness. 

Before we state the result we introduce some notations and assumptions. We denote 
mx) the 3M X 3N matrix with block elements (D^)jj(x) = i = 1,... ,M,j = 

1,... ,N and J^(x) = D^(x)D^*(x) the Jacobian matrix of the transformation. For a matrix 
A, A*' denotes its transpose, detA the determinant and A~^ its inverse. We assume that the 
map ^ is smooth and such that 

rank (D,^) = 3M . 


12 





This assumption ensures that the Jacobian matrix of the transformation J^(x) is non¬ 
degenerate, i.e. detJ,^(a;) 7 ^ 0 and its inverse exists. 

The assumption in Theorem]^ is that h{x) must satisfy 

which, as the following Theorem states, is not unique rather it is parametrized by a family 
of vector valued functions W : —)■ related to the coarse graining map. 

Theorem 3 Given the CG mapping ^ —)■ and the microscopic forces fj{x) = 

-V,^U{x), j = l,...,N,zf 

hw{x) = G^\x)W{x)f{x) + ■ G^'(a;)W(a;), (11) 

where W{x) : ^ ^smxstv is any smooth function such that 

Gw{x)=W{x)Be{x) 


is invertible, then 

F™^{z) =E^[hw\z] . 

The above theorem states that the choice of the local mean force h{x), that is how we 
construct the total force for each CG particle that corresponds to the PMF, is not unique, 
nevertheless the PMF is well defined. For different choices of W(a;) we can consider vari¬ 
ous force matching minimization problems, however, the corresponding PMF is the same. 
Furthermore some of the problems may be better than others, simpler and cheaper to im¬ 
plement. At first glance formula ( pT| seems complicated though a suitable choice of W(a;) 
can introduce major simplihcations. Note also that in the low temperature regime, where 
1/(3 1, term ■ Gjj/(a;)W(a:) is not contributing significantly and can be neglected. 

A W(x) always exists, at least in the case of smooth coarse graining map that we consider, 
since choosing W(a;) = D,^(a;) we have that Gw{x) = is invertible. In thermodynamic 

integration a well t studied choice is W(x) = D^(a;)ES12HI^ that we present in the sequel as a 
corollary of Theorem 

Corollary 1 IfW{x) = D^(x) and rank(D^) = 3M then 

h{x) = 3 r\x)Bax)f{x) + ■ jr'(a:)De(x), (12) 
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where J^(a:) = D^(x)D^*(x), and 


F^^^iz) = [h\z] . 


Note that the second term in (12) depends on the cnrvature V^; ■ J.^“^(x)D,^(x) of the snb- 
manifold f2(2;) = {x : ^(x) = z}. The coarse graining maps that are mainly considered in the 
eqnilibrinm parametrization methods, the force matching the relative entropy minimization, 
are linear mappings ^ —)■ 

N 

^i(^) ^ ^ Cij^ji Cij ^ * 1 ) • • • ) ^ 1 

i=i 


for which the corresponding cnrvature ^(x)D,^(x) = 0, since D,^(x) = T, where 

T = [Cpl3]*=i,.. is independent of x, I 3 denotes the 3x3 identity matrix. The form 

of the local mean force is thus simplihed given in the following corollary. 


Corollary 2 If the CG mapping f —)• is linear with matrix T and for any matrix 

W : —)■ such that the matrix 


Gvk = 


is invertible, then for 


holds 


Furthermore for W = T and 


where = TT^ holds 


hw{x) = (WT*)-iW/(x), 
F^^\z) = E^ [h\z\ . 
h{x) = Jr'T/(x), 
F^^^{z) = E^ [h\z] . 


The result of this corollary gives a compact and simpler presentation and proof of the form 
for the coarse-grained force held as described in worlP^. It provides a way to correctly 
calculate the total force for each CG particle from the microscopic forces corresponding to 
the given coarse graining map. In a force matching method what is often used is the total 
force acting on each CG particle as the observable quantity, i.e. for coarsening to the center 
of mass of K particles h{x) = fji^) where fj{x) is the total force acting on particle 
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j. Let us consider generally h{x) of the form h{x) = Bf{x) for a given 3M x 3N matrix 
B. The question that arises is whether with this observable we approximate the mean force 
associated to the specihc coarse graining. In view of the result of Corollary the question 
actually is whether there exists a W such that 

Bf{x) = (WT*)-^W/(a;), for all f{x). 

Therefore we are looking for a W such that 

W(l3iV-T'B) =03 Mx37V, (13) 

where O^mxsn is the 3M x 3N matrix with zero entries. The above system of equations 
has non-trivial solution, i.e., a non-zero matrix W, if B is such that l 3 Ar — T*B is a singular 
matrix. 

In the following section we study representative examples of molecular systems and coarse 
graining mappings and show in detail the application of the results of the current section, 
as a means of correctly calculating the CG transformation of the microscopic forces at the 
implementation of a force matching problem. 


V. FORCE MATCHING FORMULATION FOR LINEAR AND 
NON-LINEAR CG MAPS 

The subject of this section is to present analytically the form of the local mean force 
h{x) for specihc examples of molecular systems and for linear and nonlinear CG mappings. 
Based on the result of Theorem]^ we hnd h{x) appearing in a force matching problem, i.e., 
when 

CiG;h)=E, [\h-G{On 

is minimized over G G T, and the optimal solution is a best approximation of the PMF. We 
consider two cases in each example, hrstly choose a W(a;) and construct h{x) and secondly 
we accept that the form of h{x) is given and check whether satishes the force matching 
condition (0, i.e. investigate whether there exists a W(x) appearing in ([IT|. 


15 



FIG. 3. Coarsening a many particle system to one CG particle, the center of mass of the N particles. 

A. A-particle system under linear coarse graining maps 


Let us consider a microscopic system of N particles with masses mj, j = 1,..., A and 
position vectors x = {xi,... ,xn) G In the following sections we consider different 

linear coarse graining maps for which we derive explicit forms of the local mean force 
h{x). Dehne the linear mapping ^ —)■ by 


N 

= X] CijXj e i = , 

i=i 

for Qj G M such that Cij = 1 ^ = 1, ..., A. The corresponding matrix is the 

3M X 3A matrix T = [Tjd where Tj,- are the 3x3 blocks 


Tjj — CpIs 


0 , 0 0 
0 0 , 0 
0 0 Cn 


and I 3 denotes the 3x3 identity matrix. 


(14) 


1. Center of mass of N particles 

In this example the coarse grained variable is the center of mass of the A particles, see 
Figure That is M = 1 and the elements of the 3 x 3A coarse graining mapping matrix T 
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are Ty = Cijls) j = as in (14), with 


N 


Cij = —, j = 1, • • •, where m = Y] 


m 


'j ’ 


j=i 


N 


such that 

^{x) = '-^Xj e . 
i=i 

We distinguish two cases, the hrst choosing a specihc W and looking for the form of h{x) 
and the second by choosing a local mean force h{x) and looking for the existence of W such 
that the force matching indeed approximates the PMF. 

a. If we choose W(a;) = T then the local mean force h{x) is given by 

_ 2 W _ W 

h{x) = ^ ^ 


EI\ 

1 = 1 


ml 


E iv 

1=1^’ 

^ ind( 

h{x) = jr'T/(x), 


[X 


j=i j=i 

This is a result of the application of Corollary indeed we have that 


where 


Thus 


N 


J5 = TT‘ = — 


i=i 


h{x) = 


— 9 

m 


N 


N 


Ely 
1=1 




m 


E N 2 / ^ 
1=1 < j=i 




b. We look for W such that the local mean force is the total force exerted at the center 
of mass hw{x) = Yl!j=i Note that hw{x) is nonzero if external forces are present. In 

view of relation ( 13 ) such a 3 x 3A^ W exists if I 37 V — T*B is singular where B is the 3 x 3iV 

for which holds hw[x) = B/(x). We have that 


matrix B = 


I 3 I 3 


I 3 


det(l3Ar - T'B) = 


(1 — Cll)l 3 —C11I3 • • • —C11I3 

— C12I3 (1 — Cl 2 )l 3 • ■ • —C12I3 

~Clvl3 ~Clwl3 • ■ • (1 “ Ciw)l3 


Since we assume that Cij = 1) the sum of all column elements is zero and det(l 3 Ar — 
T^B) = 0. Thus there are inhnitely many nontrivial solutions of W(I — T*B) = O^mxsn 
W = [tciil 3 ,..., tciArl 3 ], wij G M,j = l,...,iV. For example one solution is given for 


Wij = 1, j = 1,N, that is matrix W in Corollary is W 
hwix) = Y.f=ifjix)- 


I 3 I 3 


, for which 
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FIG. 4. Coarsening a many particle system to two CG particles. 


2. Two CG particles coarse space 

We consider an example where the coarse space consists of M = 2 CG particles, Figure]^ 
and the corresponding coarse graining map is dehned by 

N N 


i=i 




with corresponding matrix 


T = 


Til • • • Tiat 
T21 . . . T 2 Ar 


5 T^o i 1,2, j 1, 


W. 


-1 r 


a. Let W = T, then 

\ i j \ j 

Fnrthermore if each particle is contributing only to one CG particle, 

Cij 


Ei Clj El - Ei C11C21 El C 2 jfj{x) 

Ei Cii El C2jfj{x) - CijC2i El 


h{x) = 




Indeed, applying Corollary 1^ we have that 


( 16 ) 


rprpt _ 


Ei Cljis Ei CiiC2ii3 
E7CliC2il3 E,Clil3 
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(TT*) ^ = 


det(TT* 


Cijh - CijC2il3 
~Y.j Cij C2j I3 Y,j Cij I3 


where det(TT*) = ^ Cy E,ci, — CijC 2 j )^5 and the local mean force is given by 


h{x) = 


det(TT* 


E,- Cl J2j Cij fji^) - E,- CijC2i E,- C2jfj{x) 


T.j Cfj Ei C2jfj{x) - J2j CijC2j Ei Cijfji^) 

If we consider that each particle is contributing only to one CG particle, i.e. CijGj = 0, the 
form of h{x) is simplihed, since 

1 


TT*) ^ = 


0 


0 


and becomes 


h{x) = 


E, 


C2j 




Note that when Qj = ruj/ YC ^k, i = 1,2, and Q = {j : Qj 7 ^ 0}, or eqnivalently 

keCi 

Ci = {j ■ particle j contributes to CG particle i}, then from relation (15) we have 


E 


mirrij 


h{x) = 

denoting m, = Yjec, = 1 , 2 , thus 

h{x) = 


y: 


[Ejec^ x:. 


kGCi 

fh2'mj 


fi{x) 

r/j(^) 


E,eCi /i(^) 
EjeC2 /i(^) 


when all particles have equal mass ruj = m, j = 1,... ,N. 


b. 


In this case we look for W such that the local mean force is 


hw{x) = 




Ejeci fA^) 

Ejec, fjA)_ 

We show that such a W exists if each particle is contributing only to one CG particle, that 


is CijC 27 ' = 0 for all j = 1,..., N. Indeed, following relation (13) we write 


hw{x) = B/(x), B = 


Cii-'-3 • • • 0(;ijvl3 


*^( 21^3 • • • ^C2N^3 


liV* 
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where = 1 if Qj ^ 0 and = 0 if Cij = 0 and calcnlate the det(l 3 Ar — T^B), where 


l3iv-T‘B 


(1 — Cll'^Cii — C21^C2i)l3 — (Cll<^Cl2 + C21<5c22)l3 


“(Cll<5cijv + C2l5c2iv)l3 


-(ClAf<^Cii + C2Ar^C2i)l3 -(CliV<^Ci2 + C2Af<5f22)l3 • • • (1 - CliV<^CiiV ~ C2Af^C2iv)l3 
Using the assnmption that J2j&Ci Cp = lu = I 5 2 and properties of matrix determinants we 
have that 


(1 -%!-%! )I3 (1-5C12-'^C22)I3 (1-<5C,,-5C2^)I3 


det (IsAT—T* B) =det 


-(ClAf<5Cii+C2Af5c2i)l3 -(ClAf%2+C2V'5c22)l3 - ClN^Ci^ - C2 nS(2n)^3 

Thus we see that det(l 3 Ar — T*B) = 0 if 1 — — < 5 ^ 2 ^. = 0 for all j = 1,. .., N, i.e., when CijC 2 j = 0 

that means particle j contributes only to one CG particle. 

Assume now that there exist only one particle k that contributes to both CG particles. Let 
for simplicity choose k = 1, and C 21 = l)C 2 j = 0 , j 7 ^ 1, and Cij 7 ^ 0 for all j = 1, ..., A" s.t. 
Ej Cij = 1 then 


det(l3iv - T'B) 


-I3 O3 

-C 12 I 3 (1 - Cl2)l3 • • • 


O 3 

—C 12 I 3 


_-Clvl3 -ClvU • ■ • (1 - ClAf)l3 
which is nonzero. Thus we found an example of CG map for which there does not exist any W 

E' f'{x) 

s.t. hw{x) = ^ ,Ci = {j: Cij / 0} . This suggests that for this CG map one should 

LEieC 2 

choose a W and then construct the h{x) in order to achieve the PMF approximation with the force 


matching, as is calculated for example in case V A 2 a 


3. Two particles contributing to each CG particle 

With this example we examine the coarse graining where each CG particle is the average of two 
particles position vectors which contribute only to that CG particle, Figure That is, assuming 
that the number of particles N is even, the number of CG particles is M = A/2 and the mapping 
is defined by 

— Ci,2i—lX2i—l T Ci,2iX2ii i — 1) ■ ■ ■ ; i 
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FIG. 5. Coarsening a many particle system with two particles per CG particle. 

for (ij G M, and Cij = 0 if j / 2i — 1, 2i, such that + Ci, 2 i = 1 for all i = 1,..., M. The 

3M X 3N matrix of the linear mapping ^ is 


T = 


Til Ti2 0 0 0 

0 0 T23 T24 0 

0 0 0 . 


. 0 0 

. 0 0 

Ti,2i-i Ti^2i 0 : 

0 Tmn-1 Tmtv 


where Tij = Qjls, i = j = I,..., 2M. 

a. Let W = T, applying Corollary we have that 


= 72 -r 72 “ {Ci, 2 i-lf 2 i-l{x) + Ci, 2 if 2 i{x)) , i = 1, . . . , M , 


Ci,2i-1 + Ci, 


2i 

Let TTij denote the mass of the j — th particle and set 

Ci,2*-1 = --, Ci,2i = - j- - i = 

7712,-1 + rn2i rn2i-i + m2i 

then if m 2 i-i = 7772,, i = 1,..., M we can have 

hi{x) = f 2 i-i{x) + f 2 i{x), i = . 

b. We show that there exists a family of 3M x 3N matrices W appearing in Theorem 
such that 


hwA^) = f2i-iix) + f2iix), i = . 
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Let W with block entries tcjjis, i = 1,2,..., M j = 1,... ,N. The later equality holds if, in 
view of W(l3Ar — T*B) = OsMxSN where 


B = 


I 3 I 3 0 0 0 . 0 0 

0 0 I 3 I 3 0 . 0 0 

: I 3 I 3 0 : 

0 0 0 . 0 I 3 I 3 

We have that det(l 3 Ar — T*B) = 0 since Ci,2i-i + Ci,2i = 1 for alH = 1 ,. .., M. Therefore, according 
to (13), there exist infinitely many M x (2M) matrices W such that W(l 3 jv — T*B) = 0, that 


gives 


and 


Wi^ 2 j = Wi, 2 j-i, for all i,j = 1 


,...,M, 


{hwix))i = f 2 i-iix) + f 2 iix), i = l,...,M . 

Note that in all the above examples with linear CG maps the form of h{x) can also be written 
in the form 


N 


d, 


h{x) = -f-fj{x), i = l,...,M , 

j=i 

when appropriately choosing constants dij. This fact proves that our approach reproduces the 
results in Noid et.aP^ and is indeed an extension that holds for any nonlinear CG map, that we 
show with examples in the following section. 

B. Force matching formulation and non-linear CG maps 


In this section we examine the application of the force matching method with examples where 
we consider that the coarse graining mapping corresponds to a reaction coordinate, that is in 
principle a nonlinear mapping ^ —>■ M™. 

We borrow the example from ref.^, where the corresponding free energy differences and PMF 
were calculated explicitly using generalized coordinates. Here we only consider the mapping to 
the reaction coordinate and a proper selection of W(x) appearing in (11), as is also remarked iiP^l 


Section 4.4. In this example the microscopic model is a single molecule consisting of three atoms. 
Let Xj G M^, j = 1, 2, 3 denote the position vectors of the atoms, see Figure 6. 
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FIG. 6. Three atom molecule a) bending angle 9 b) end-to-end vector i. 


1. Bending angle 


The coarse variable is the bending angle 9 =< X1X2X3 , see Figure 6, 




(0,27r), ,f(x) = acos 


< 3^3 - X2,Xi - X2 > 


(16) 


Iks - 3:2111^1 - 3:2II 

where < >, || • || denote the Euclidean inner product and norm in respectively. Applying 

Corollary!^ that is choosing W(x) = D,^(x), the local mean force is 

h{x) = • J^“k 3 :)DC( 3 ;), 

where JC(3;) = D^(a:)D^k 3 ^)- Here D^(a:) = {Vxi^{x),Vx2^{x),Vx3^{x)) G where 

^ sm(^x)) 

and J^(3:) = ||D^(x)|p G M. Thus 

1 


1 ^ <X3-X2,Xi-X2> . 

-[77]-[]-, j = 1,2,3, 

ks - X2 kl “ X2\\ 


h{x) = 


l|De(3;)|| 


^mx)fix) + 


iiDe(3:)ii^ 


rDe(3;: 


where ^{x) is given by ( 16 ). 


2. End to end distanee 

Let us now choose the end to end distance pisH as a coarse variable, 

^ ( 0 ,oo), ^(3;) = Iki - 3 ; 3 || := kisll , ( 17 ) 

for which D^(x)= |ki — 3;3|k^ (xi — 3:3,03,073 — xi) and = 2 . Applying Corollary 

we have 

Hx) = ^D^(x)/(x) ^Vx ■ Qd^(x)^ 

^ < X3 - xi,/ 3 (x) - /i(x) > 3 

2|k3-3:i|| /? ’ 
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since • D^(x) = 6, following the definition (©• 

Remark. A coarse variable that is of interest in molecular systems is the end-to-end vector 
■^13 = — xi G the corresponding map is linear with 

^(x) = X3 - xi := £i3 , 

The mapping has the 3x9 corresponding matrix 

-10 0000100 
T = D^= 0 -1 0 000010 

0 0 -1 000001 
Since the mapping is linear we can apply Corollary which gives 

Kx) = ^D^(x)/(x) = ^ {h{x) - fi{x)) . 

VI. FORCE MATCHING AND INFORMATION-BASED PROJECTIONS 

In this section we show there is a strong link between coarse-graining viewed as minimization 
of relative entropy and CG derived from force matching optimization principle in presented 


in Sections III and IV We first start the discussion with a brief outline of the relative entropy 


minimization and continue with its the relation with force matching. Finally, we include a brief 
description of structural based methods in order to provide a complete view of the methods for 
potential of mean force approximations in coarse graining. 

A. Relative entropy 


The relative entropy approachP^ considers the minimization of the relative entropy functional 

minT^ (/i II/io) = minEo log— (18) 

U&V U&V I 

over a space V = {U\U : —)> M} of interaction potentials. If the CG potentials are parametrized 

with 9 £ Q the minimization is considered over the parameter space 0. The minimization problem 
is based on the properties of the relative entropy a) TZ {/j. \ \ n) > 0 for all probability measures n, vr 
and b)7^ (/x 11 vr) = 0 if and only = n. 

The relative entropy TZ fifj) is a pseudo-distance between the microscopic Gibbs measure 
fi{x) oc and a back-mapping of the proposed Gibbs measure at the CG space oc 
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= ^ijj{z)v{x\z ), 


associated with the proposed interaction potential U{z) where 


v{x\z) 


l{x) 

Z^{z) 


with 7 any nonnegative function, 


(19) 


and 

Z^(z) = E[ 7 |z] = / 'y{x)dx, / v{x\z)dx = 1. 

Jq{z) Jq{z) 

Recall that = {x £ : ^(x) = z}. The measure iy{x\z) is a normalized conditional 

probability of sampling an atomistic conhguration x given a CG conhguration z (microscopic 
reconstruction). A mathematical formulation of microscopic reconstruction is presented in our 
worlpSl while probabilistic reconstruction methodologies are proposed and tested ijJSISESHlSI^ 

The difference in relative entropy between /i(x) and fJ-(j{x) is written 


='^ id'll 1^0) + J T^idi-lz)lliyi-lz))fi{dz) ( 20 ) 

where p,{z) = E[^|z] is the exact coarse grained measure Q, and fi{x\z) is the unique measure 
^{x\z), i.e., such that fj,{dx) = j2{dz)fi{dx\z) . Relation (20) shows that the difference is composed 
from two parts a) the error in the approximation of the exact Gibbs measure fl{z) corresponding 
to the f7™^(z) by fljy, IZ {jl \\ fijy) , and b) the error in reconstruction, f 7Z (fi(-lz) \\iy{-\z)) fL{dz), 
that is the error in approximating fJ-{x\z) by i'{x\z). 

In the relative entropy minimization method, as defined by Shell et.al.l^*^ lix) = 1 assigning 
the same probability to all atomistic configurations x that map to the same z. The reconstruction 
measure is the uniform distribution ^{xlz) = l/|n( 2 ;)|, where |fl(2;)| is the volume of the set ^(z), 
and the error introduced is 


7^ (//(-jz) 11 i^(-|z)) = log |fl| + J fj,{x\z) log fj,{x\z)dx\z . 

Note that for this choice of reconstruction the error does not depend on the proposed approximating 
potential U{z), the error is constant for any JJiz) £ V. In the ideal case where 7 (x) = fi{x) the 
reconstruction is considered exact, there is no reconstruction error since z^(x|z) = n{x\z) and 
TZ ifJ-i'lz) 11 1 'i'lz)) = 0, and the minimization problem is equivalent to minf/gy TZ{fi\ \ fiij). 

In view of the last two observations it is verified that the relative entropy minimization method, 
with uniform or exact reconstruction, is indeed approximating the potential of mean force (/^^^(z) 
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since the minimization problem minp^gy 7?, (/r || is equivalent to the 

min7^(/i||/xp) . 
u&v 


B. Relative entropy and Force matching 


The goal of the last part of this section is to compare the force matching method with the 
relative entropy minimization method. The common point of both methods is their relation to the 
PMF. The relative entropy is directly related with the PMF through relation ( |20[ ) while the force 
matching method at equilibrium approximates the PMF if, as stated in Theorem the local mean 
force h{x) is such that [h\z]. 

As discussed in the previous section, a reasonable choice for the reconstruction is 7 (x) = /u(x), 
the equilibrium Gibbs measur^lSl^ thus 


Z^{z) = n{z). 


Practically this choice of 7 (x) means that we sample from the Gibbs measure using constraints on 
z. One can easily check that the relative entropy Tl{fJ-\ \ lijj) for 7 (x) = ^{x) is rewritten as 




log- 


/i(z) 


= 7^(/i||/ip) . 


Based on the above equality and the properties of the relative entropy we can see that the minimum 
value of 7^(/u||/xp-) is given when /Up(z) = /u(z) corresponding to the PMF under the 

assumption that the reconstruction probability u{x\z) is exact, i.e., TZ {fj,{-\z) || i^{-\z)) = 0. 

With the following theorem we compare the relative entropy minimization and the force match¬ 
ing methods under the assumptions that both approximate the PMF in the sense discussed 

in the previous sections, i.e. = E^ [h\z] in force matching and 7 (x) = /i(x) in relative 

entropy minimization. 


Theorem 4 (Relative entropy and force matching at equilibrium) 

Consider a microscopic system in at equilibrium, characterized by the interaction potential 
U{x) and the Gibbs measure fi{x). Let ^ —)■ be a CG mapping, U{z) ^ V be a family 

of interaction potentials on the coarse space with Gibbs measure fify{dz) and h : 
such that = K^[h\z]. Let tijj{x) = Jiiy{z)pL{x\z) where p.{x\z) = ^(x|z). Consider the 

following two minimization problems at equilibrium 

min 7^ (/i II/ip) , (Relative entropy), 
u&v 
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and 


min£(G;/i) = minE^ ||/i —G(^)|| , (Force matching), 

G^S G^S L J 

where V = {tJ : ^ M}, £ = {G G g ^ -VU{z), U £V}. 

Then the leading term at the relative entropy approach is the square of the potential difference 

(j;PMF _ ^ ^ _ j;PMF|3j^ ^ (21) 


min TZ (pL \ \ pLjj) = min En 
c/ev uev 


where 0{g) denotes a quantity bounded by g, and the force matching minimizes the square of the 
potential gradients difference 


min£(G;/i) = ; h) +minEn [HV - G) fl . 

Ge£ ueV 


( 22 ) 


Proof: The relative entropy functional TZigW gty) (18) with 7 (x) = pL{x) in (19) becomes 


'^{hWhu) 


log — 


g{x )log - 


H{x) 


Pij{z)n{x\z) 
fl{z 


d{x\z)dz = j I pi{x) log \ \ d{x\z)dz 


gu{z)p{x) 


pL{x)d{x\z)\og- — Y^dz = / fi{z)log — 


H{z) 




Tui^) 


dz 


= % 


log- 


fl{z) 




thus, since pi{z) = e /Z and iiiy{z) = e fZ^j, 

n{pi\\gtj) = [-P -U)]+ log Zfj/Z . 

Expanding the logarithm and the exponential in the partition function term, when 
jiPMF(^) _ ij(^z) is small, we get 

^ogZij/Z = ZpjjZ “ 1+2 ~ 1)^ + ^{{ZqIZ — 1)^), and 


Z^/Z — 1 — E^ 




- 1 


1 


= -/3E^ [{U - [(^7 _ 77PMF^2] ^ oifd^Ep [{U - 


Therefore 


log Zjj/Z = -/3E^ [{U - U™^)] + (d^Ep [{U - + Oif^EY, [{U - U™^f ]), 


and the relative entropy is 


7^(/r||/i£;) = /3% 


(^PMF 



+ 0(/3% [|C/-P™^|3]). 
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We contrast this finding with the minimization in C below. For the functional C{G] h) the repre¬ 
sentation holds 

£(G; h) = C{F- h) + /3% [||F(0 - G(Of ] , 

thus under the assumption that F{z) = [h\z\ = and that G G is of the form Gi{z) = 

—'^ziU{z) for [7 G V and the definition of mean force = — V^.C7^^^(2;), i = 1,... ,M we 

have 

£(G; h) = + /3% ||VC7™’"(0 - Vi7(0|f^ 

thus, since E^ [<7’(0] = [4’\ for any observable (j) in we have 

£(G; h) = C{F™^) + /3% ||VC7™’"(z) - VU{z)\\ 

□ 


Observing relations (21) and (22) we notice that the leading term at the relative entropy ap¬ 
proach minimizes the average of the square of potential difference — [7)^, i.e., it is an £^(/u) 

error. On the other hand, the force matching minimizes the average of ||V ([7^^^ — U) |p, an 
error, where H^{fl) = {g G ■ weak first derivatives Dg G Thus, assuming the 

minimization problems have unique optimal solutions, and for the relative entropy and 
force matching methods respectively, these solutions differ by a constant. 


C. Structural based parametrization methods. 

This section concerns an alternative family of CG effective potentials given by the structure 
based or correlation based methods such as the inverse Boltzmann, directP and iterativ^^ol^ a,nd the 
inverse Monte Carlo methodd^. Theoretically, if one can compute the n-body correlation function 
g^{z), n < M from the microscopic system simulations then according to the relatiorP^ 

t/PMF(z(fo) = -ilogr(^(")) 

where = (zi ,..., Zn), is the n-body correlation function, the computation of is 

straightforward, and the structural based methods, in principle, can provide exactly the potential 
of the mean force, as is the case of the relative entropy and force matching methods. 

However, the computation of g^{z) is not feasible for large n, and what is in practice used at 


inverse Boltzmann and inverse Monte Carlo methods is the pair correlation 











In homogeneous systems g^‘^\zi,Z 2 ) depends on the relative position between two particles r = 
Iki ~ - 2211 , called the radial distribution function 




'^B{z2,r){zi)f^{x)dx , 


P P J {x:^{x)=z} 

that is the average density of finding the CG particle 1 at a distance r from the particle 2. Moreover, 
all structure based methods rely on Henderson’s uniqueness theorenJ^, which states that for a given 
radial distribution function there is a unique, up to a constant, pair potential v{r) such that 


U™^(z) = 


E 


v[\\Zi - Z4 


The structure based methods with the use of the pair radial distribution function in principle 
are comparable to the force matching and relative entropy when the later ones consider the family 
of proposed potentials, V in Theorem to consist of pair interaction potentials. The numerical 
comparison of all methods for molecular systems under equilibrium and non-equilibrium conditions 
is the subject of the future wori!^. 


VII. DISCUSSION AND CONCLUSIONS 

The main goal of all systematic CG approaches, based on statistical mechanics, is in principle 
to derive effective CG interactions as a numerical approximation of the many-body potential of 
the mean force, which for realistic molecular complex systems cannot be calculated exactly. 

In this work we have presented a general formalism for the development of CG methodologies 
for molecular systems. Below we summarize the main outcomes of the detailed analysis presented 
in the previous sections: 

(a) The probabilistic formalism discussed allows us to define a systematic force matching, as a 
CG minimization problem both for linear and nonlinear CG maps. This probabilistic formulation 
gives a geometric representation of the force matching method, as is schematically depicted in 
Figure]^ (b) A practical outcome of (a) is the connection of force matching with thermodynamic 
integration that provides a way on how to construct a local mean force in order to best approximate 
the potential of mean force with force matching. Specifically, this connection introduces a family of 
corresponding (to the CG map) coarsening transformations of the microscopic forces (local mean 
force) that ensure the best approximation of the PMF. This approach extends the work in ref.^^^^, 
for any nonlinear CG map. (c) GG methods based on relative entropy and force matching are 
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in principle asymptotically equivalent, in the sense of Theorem]^ both for the case of linear and 
nonlinear coarse-graining maps. Furthermore we prove, for linear CG maps in a specific example of 
a system with N molecules, that the (un-weighted) total force exerted at each CG particle satisfies 
the force matching condition when each particle is contributing to a single CG particle, see the 
example I V A 2 b This fact, along with the example of the nonlinear CG map studied in Section V B 


suggest that for complicated linear and nonlinear CG mappings one can use appropriately formula 
0 and achieve the best approximation of PFM with the force matching method. 

Current work concerns the extension of this formalism, following the results in ref.,^, to coarse 
graining in non-equilibrium systems, an important challenge where in principle CG methods fail.®l 
The numerical implementation of the formalism to different complex molecular systemdS! jg also 
the subject of current studies. 


Appendix A: Conditional expectation and coarse graining 


Let be the probability space induced by the random variable X of atomic con¬ 

figuration. 0 is the (T-algebra generated by the random variable X, i.e. it is the collection 
g = {A£ in s.t. X ^{B) = A} , Gonsider the coarse-grained random variable 

^ = ^{X) and define the sub u-algebra of Q, induced by 

g^ = {A ^ ^3M = A} , 

i.e. any function (j) : —)• M that is ^^-measurable is of the form 




Denote fl(z) = {x G R^^ : ^{x) = z}, the sub-manifold of R^^ corresponding to configurations x 
at a fixed value of the coarse grained variable 2 ; G R^^. The conditional expectation with respect 
to g^ is the random variable [</'!?], defined by 


E/, [(j)\C = z] = E^ [(j)\g^] (z) 


1 

fiiz) 



4>{x)fj,{x)dx, 


for any 2 ;, 


and for any ^-measurable (j), with 

fl{z) 

that is the average of </> keeping 2 ; fixed. 



fj.{x)dx , 
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Appendix B: Proofs 


1. Proof of Lemma [T] 


Let F{z) = then for any G & 6 holds 


C{G;h)=E^ 

]\h-G{Of 

= E^ 

= E^ 

]\h-F{Of 

+ E^ 

= E^ 

'\\h-F{Of 

+ E^ 


iih-F(o+F(o-G'(oir 


\\m - G(oir + 2E^ [{h - Fmrio - g(0)] 


\m)-G{or 

= £(F;h)+E^ ||ir(^)_G(Of^ 


Since 


E/. [{h - Fimm - G(e))] = e^ [e^ nh - Fiomc) - cm n] 

= E^ [(P(0-G(0)E^[(/i-i"(0)k]] 

= E^ mo - GiO) (E^ [h\z] - FiO)] = 0 . 


Thus 


and 


inf £(G;h)=£(F;h) 

(j^O 


£(G;h) = £(F;h)+E^ ||ir(^) _ G(0|| 


□ 


2. Proof of Theorem 

Let the sub-manifold Q{z) = {x G : ^(x) = z} of have the co-dimension 3M, i.e., 
dim(M^^) — dim(fl( 2 ;)) = 3M. The 6 measure is defined as follows, for any smooth test function 
(j) : ^ 

[ 5{i{x) - z)(t){x)dx = [ 0(detJO"^^^dEQ(^) , 

jRSiv 

where •* denotes the matrix transpose, det(-) the matrix determinant and denotes the surface 
measure on n(z). Let the mollifier on 

mix) -z)= any e > 0. 
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We have 


M 

V,/,(e(x) - z) = - z)V^^^i{x) 

i=l 

M 

= -J2^M^{x)-z)V,j^i{x), 
i=l 

recalling the notation (D^)jj = 'Vxjii^ i = 1,..., M,j = 1,... ,N, then we can write 

V,,5,(^(x) -z) = -ne^JMx) - z), 


and, since we assume that (x) is invertible, we can write 


WV,<5,(e(x) -z) = -WDC'V,,5,(^(x) - z) 


from which we have 


VMCix) -z) = - (WDC') ^ WV,,,5,(^(x) - 


Taking the limit as e —)• 0, in view of Lemma in Appendix [B| we have 

V,5(e(x) - z) = - {WB^y^ WVxSiax) - z) 
We recall the definition of the potential of mean force ([^, 


U™yz) = log A(^) - i logZ = log^^ ^ , 


which we rewrite as 


^PMF(^) ^ _ 1 iQg / _ zy-PU(x)^ 

P Jr3N 


lx. 


Therefore, in view of the relation (Bl) 


P PKz) J^iN 


5{i{x) - z)e-^^^^Ux 


^.mx) - z)e-^^^-Ux = [ VM^x) - 

P p{z) Jr3N P ^{z) e^o J^3N 


= — i _ ^ lim 
/3 jiyz) e^o 

- lim 


im / [-(WD^*) 

-^^Jr3N L 


x) - z) 


^-pu{x)dx 


-Vx-[{WBy ^W]+/3(WDe') ^WVxU{x)]e-f^^^%{yx)-z)da 

[ \p (WBy WVxU(x) - Vx • [{WBy w] 

/3 n{z) J^3N L ^ ^ ^ ^ J 


e-^^(^)<5(^(x) - z)dx 


= E, 


(WD^*) ^WVxU{x)-^Vx-[{WBy 'W]| 


(Bl) 
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Thus we conclude that 


that is 


(WDC*) 'W(-V,C/(x)) + -V,-[(WDe‘) ^W]| 


^PMF(^) 


(WD^*) iw/(x) + ^V,-[(WDe') ^W]| 




□ 


Lemma 5 Let the mollifier on be 


5ei^{x) -z)= ^- 2 ^g| 3 M /2 ^ anye>0 


then for any test function 4> : 


/lR3iV 


5e{f{x) — z)4>{x)dx ^ J 5{f{x) — z)(f{x)dx as e —)• 0 . 


Furthermore, 


xde{f{x) — z)4>{x)dx ^ j V x5{f,{x) — z)4>{x)dx os e —)• 0 . 


Proof: Let the smooth test function (f ■ ^ We have 



z)(j){x)dx 


J]^3N (27re)3W2 




1 

(27re)3W2 



e (l){x)dT.Q(^^)dz . 


Next we define the orthogonal projection onto f2(2;), 


Pn{z) ■ ^^(2:), X f C — Pn(z)X 

then = T^n( 2 ;) © N^Ll{z), where Tg and denote the tangent and normal space to Ll{z) at 

We denote the local coordinates on 12(2:) at (ri(^),... ,T3n-3m{0)^ then 

3M 

X = x{t, r]) = ^(r) + ^ rueiiHr)), 
i=l 

where ^(r) G 12 ( 2 ) and Yl\=i Vi^iiCix)) is its normal conjugate, therefore 

dx = (detJ(T))^^^ drcir/ 

where J(r) = V.^(r)*V.^(r) is the metric induced by the embedding of 12(2). In other words 

(detJ(T))^/^(ir(ir? = • 
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Thus based on the expansion of 5 |^(x) — zj^ around its minimum on fi(z) 

\\i{x) - = ^|D^(x)(x - + 0{\x - z\^), 

where D^(x) is the matrix with elements (D^)jj(x) = dxj^i{x), i = 1,, 3M,j = 1,..., 3N, we 
can write 


/ 


1 


R3iv (27re)^^/^ 

1 r 


e-i-M^^-^\"cj){x)dx = 


e-ilD«W(^-^)lV(x)dx + 0(e) 


(27re)3W2 J^3pf 

{(f>{z) + (f>'{z)ri + 0(|??p)) (detJ(r))^'^^ drdr] + 0(e 


(27re)3A^/2 Jj^sN 

(/>(^(r)) (detJ^(^(r)))“^'^^ (detJ(T))^/^ drdrj + 0(e 


' ^ 2 ( 2 ) 


lQ.{z) 

Furthermore, 


/ (f) (detJ^) dT,^i^\ = / 5{^{x) — z)(j){x)dx as e —)■ 0 . 

Jn(z) Jr3n 


JR3N 

which is proved similarly. 


'^xde{C{x) — z)4>ix)dx ^ J \7xS{^{x) — z)(j){x)dx as e —)■ 0 , 


□ 
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